High-speed Cluster Interconnects Myrinet, Quadrics, and Infiniband Achieve Low Latency and High Bandwidth with Low Host Overhead. However, They Show Quite Different Performance Behaviors When Handling Communication Buffer Reuse

نویسندگان

  • Jiuxing Liu
  • Balasubramanian Chandrasekaran
  • Weikuan Yu
  • Jiesheng Wu
  • Darius Buntinas
  • Sushmitha Kini
  • Dhabaleswar K. Panda
  • Pete Wyckoff
چکیده

Today’s distributed and high-performance applications require high computational power and high communication performance. Recently, the computational power of commodity PCs has doubled about every 18 months. At the same time, network interconnects that provide very low latency and very high bandwidth are also emerging. This is a promising trend in building highperformance computing environments by clustering—combining the computational power of commodity PCs with the communication performance of high-speed network interconnects. There are several network interconnects that provide low latency (less than 10 μs) and high bandwidth (several gigabytes per second). Two of the leading products are Myrinet and Quadrics. Recently, InfiniBand has entered the high-performance computing market. All three interconnects share similarities. For one, they provide user-level access to network interface cards for performing communication; they also support access to remote processes’ memory address spaces. However, they also differ in many ways. So the question arises: How can we conduct a meaningful performance comparison among all three interconnects? Traditionally, researchers have used simple microbenchmarks, such as latency and bandwidth tests, to characterize a network interconnect’s communication performance. Later, they proposed more sophisticated models such as LogP. However, these tests and models focus on general parallel computing systems and do not address many features present in these emerging commercial interconnects. Another way to evaluate different network interconnects is to use real-world applications. However, real applications usually run on top of a middleware layer such as the Message Passing Interface (MPI). Therefore, the application-level performance reflects not only the capability of the network interconnects, but also the quality of the MPI implementations and the design choices of the MPI implementers. Thus, to provide more insight into Jiuxing Liu Balasubramanian Chandrasekaran

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Micro-benchmark level performance comparison of high-speed cluster interconnects

In this paper, we present a comprehensive performance evaluation of three high speed cluster interconnects: InfiniBand, Myrinet and Quadrics. We propose a set of microbenchmarks to characterize different performance aspects of these interconnects. Our micro-benchmark suite includes not only traditional tests and performance parameters, but also those specifically tailored to the interconnects’ ...

متن کامل

Reducing Diff Overhead in Software DSM Systems using RDMA Operations in InfiniBand

Software DSM systems do not perform well because of the combined effects of increase in communication, slow networks and the large overhead associated with processing the coherence protocol. Modern interconnects like Myrinet, Quadrics and InfiniBand offer reliable, low latency (around 5.0 s point-to-point), and high-bandwidth (upto 10.0 Gbps in 4X InfiniBand). These networks also support effici...

متن کامل

The Quadrics Network Extends the Native Operating System in Processing Nodes with a Network Operating System and Specialized Hardware Support in the Network

The interconnection network and its associated software libraries are critical components for high-performance cluster computers and supercomputers, Web-server farms, and network-attached storage. Such components will greatly impact the design, architecture, and use of future systems. Key solutions in high-speed interconnects include Gigabit Ethernet, GigaNet, the Scalable Coherent Interface (S...

متن کامل

Performance Evaluation of Distributed Computing over Heterogeneous Networks

RWAPI is a low-level communication interface designed for clusters of PCs. It has been developed to provide performance to higher applications on a wide variety of architectures. We implemented RWAPI on top of the modular software architecture called GRWA. RWAPI supports Ethernet, InfiniBand and Myrinet network interconnects. This paper introduces RWAPI and the design of its network component o...

متن کامل

A New Credit-Based End-to-End Flow Control Protocol for High Performance Interconnects

Department of Computer Engineering Master’s Degree in Computer and Network Engineering A New Credit-Based End-to-End Flow Control Protocol for High Performance Interconnects by Javier Prades Gasulla High Performance Computing usually leverages messaging libraries such as MPI, GASNet, or OpenSHMEM, among others, in order to exchange data among processes in large-scale clusters. Furthermore, thes...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004